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Abstract 

We introduce a growing network evolution model with nodal attributes. The 
model describes the interactions between potentially violent V and non- 
violent agents who have different affinities in establishing connections 
within their own population versus between the populations. The model 
is able to generate all stable triads observed in real social systems. In the 
framework of rate equations theory, we employ the mean-field approxima- 
tion to derive analytical expressions of the degree distribution and the local 
clustering coefficient for each type of nodes. Analytical derivations agree 
well with numerical simulation results. The assortativity of the potentially 
violent network qualitatively resembles the connectivity pattern in terrorist 
networks that was recently reported. The assortativity of the network driven 
by aggression shows clearly different behavior than the assortativity of the 
networks with connections of non-aggressive nature in agreement with recent 
empirical results of an online social system. 
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1. Introduction 



Complex networks have been the focus of the study of dynamical 
ties of complex systems in nature and society in the last decade [l|, 
Usually all nodes are assumed to belong to one population or class, and 
the interactions between two distinct populations have been reported just 
recently js], lof. The dynamical properties of two interacting populations, ex- 
troverts and introverts were recently studied using dynamical network evo- 
lution model [5]. Buldyrev et al joj developed a framework for understand- 
ing the robustness of interacting networks. Using the generating functions 
method they present the exact analytical solutions for the critical fraction 
of nodes, which upon removal, will lead to a complete fragmentation into 
interdependent networks. 

The focus of the current approach in understanding the formation and 
evolution of terrorist networks is on middle-range perspective as opposed to 
the micro-level approach that considers individual terrorist and macro-level 
analysis of the root causes of terrorism fi\. In particular, the interest is in 
placing the relationships between individuals in the context of (i) their in- 
teractions with each other, (ii) how they are influenced by ideas originating 
from their environment, (iii) their interactions with people and organizations 
outside of their group |8|. Motivated by this description which conceptually 
refers to a complex system and because networks provide a fruitful frame- 
work to model complex systems jof, we introduce a network model that aims 
to describe the interactions between potentially terrorist and non-terrorist 
populations. 

Our goal is to present a model of a social network that contains two types 
of agents which demonstrate different affinities in establishing connections 
within their own population versus connections with the other population. 
We also aim that the model is simple enough to allow the derivation of ap- 
proximate analytical expressions of the basic characteristics of the network. 
The latter can be achieved in the general framework of rate equation theory in 
the mean field approximation which has been introduced to study fundamen- 



tal characteristics of growing network models [10|, . Within this framework 



we introduce the rate equations that are specific to our model, solve them 
and obtain analytical expressions that predict the growth dynamics of the 
degree of individual vertices. 

While the probability to create initial connections is easily defined, the 
probability for the secondary contacts between nodes is difficult to derive 
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Figure 1: Possible combinations of connections between a new node v, the initial contact 
j and the secondary contact i. (a)-(d) show the connections when the new node is an N 
node, (e)-(h) show the connections when the new node is a V node. Empty symbols (o) 
mark an N node while full symbols (•) mark a V node. Full lines represent links between 
nodes of the same type, e.g. friends, such as NN or VV. Dashed lines represent links 
between nodes of different types such as NV, e.g. enemies. 

because of the interactions between the two types of nodes. As a first ap- 
proximation, we assume that the initial and secondary contacts form edges 
with the same probabihties. To include a more precise contribution of both 
the initial and the secondary contacts in the rate equations we empirically 
obtain their functional dependences. This leads to an improved agreement 
between analytical and numerical simulation results. 

From the functional dependence of the degree of a node as a function 
of time using the mean-field arguments, we derive the degree distribution 
for each of the two types of nodes. We also derive analytical expressions of 
the structural, three-point correlations between nodes to study the clustering 
properties of the networks. 

2. Model 

The network models are broadly classified into two categories: the net- 
work evolution models in which addition of new edges depends on the local 
structure of the network, and nodal attribute models in which the existence 
of edges is determined solely by the attributes of the nodes (for review see 
(lil). The network evolution models can be further categorized into growing 
network evolution models and dynamical network evolution models. In the 
former the network growth starts with a small seed network and nodes and 



3 



links are added according to specific rules until the network reaches a pre- 
determined size. Dynamical network evolution models start with an empty 
network and edges are added and deleted according to specific rules until 
statistical properties of the network stabilize. 

In this paper, we propose a model that incorporates a growing network 
evolution process with nodal attributes which could be thought of as a new 
class of model. The model has five free parameters, three describing the 
growing network evolution process and two describing nodal attributes. The 
parameters of the growing network evolution process are the number of nodes 
A^, the average number of nodes selected at random as initial contacts rrir, 
and the average number of nodes selected as secondary contacts among 



the neighbors of each initial contact [13|. Two parameters quantify the type 



and amount of nodes, non-violent N or potentially violent V nodes and the 
type of interactions between them which SbTQ clS follows: (i) nodes are ran- 
domly marked as no n- violent with probability pn and potentially violent 
V with probability = ^~Pn] (h) when establishing initial contacts, nodes 
connect with probability Ps if the nodes are of the same type, such as V 
with ^ or with A^, or with probability = 1 — if the nodes are of 
different types such as with V nodes. The secondary contacts are estab- 
lished with nodes among the neighbors of the initial contacts. The model 
combines the random attachment of initial contacts with the implicit prefer- 
ential attachment of the secondary contacts. In that, the model represents a 
generalization with two types of node attributes of growing models such as 



]J,ll5|,ll3 



The definition of establishing edges can be thought of as creating links 
between friends (solid lines in Fig. [1]) or between enemies (dashed lines). By 
varying the value of Ps, we can generate different strengths of friendliness or 
animosity. The possible configurations of triads that arise in social networks 
in such a context are: (a) three friendly interactions; (b) one friendly and 
two unfriendly connections; (c) two friendly interactions and one unfriendly; 
(d) three unfriendly interactions j^. According to the strong formulation of 
structural balance theory in social sciences, configurations (a) and (b) are 
considered stable while (c) and (d) are unstable and likely to break apart 
(l6i] . In their empirical large-scale verification of the long-standing structural 
balance theory, the authors of j^H find that the unstable triads, especially 
formation (c) are extremely underrepresented in an online social system in 
comparison to a null model. Our model produces correctly the stable configu- 
rations (a) (Fig. [T^,e) and (b) (Fig. [T)D-d,f-h) but cannot produce the unstable 
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configurations (c) and (d) in accordance with the strong formulation of the 
structural balance theory (lif . 

The model algorithm consists of the following steps: (1) start with a seed 
network of uq connected nodes among which some are N and some are V, 
depending on p^; (2) at each time step add a new node, which has probabil- 
ity pn to be a non- violent and py = 1 — pat to be a violent node; (3) select on 
average > 1 random nodes as initial contacts. The probability to connect 
the same type of nodes is ps while = 1 —ps is the probability for initial con- 
tacts if they are of different types. (4) select on average m<j > nodes among 
the neighbors of each initial contact as secondary contacts. Connecting the 
new node with the secondary contacts is done without checking if it is the 
same type of node or not. There are two reasons for this choice, (i) Because 
the probability to establish inter-population connections [ps) is higher than 
the probability to connect nodes intra-population (pd), it is more likely that 
the first contact and its neighbors (potential secondary contacts) are of the 
same type than of different types. Therefore the secondary contacts will be 
more likely to be intra-population contacts even without explicitly modifying 
their probability to connect based on the type, (ii) The secondary contacts 
are meant to mimic the 'friend-of-a-friend' type of contacts in the real world, 
and we think that the implicit preferences given by the existing network con- 
nections would more accurately describe the nature of such contacts without 
including an explicit separate probability. Apply steps (2) to (4) until the 
network reaches the necessary size. 

2.1. Rate Equations 

We start with constructing the rate equations that describe the change of 
the degree of a node on average during one time step of the network growth 
process for each of the non-violent and violent V nodes. The degree of a 
node grows via two processes. One is the random attachment of connecting 
a new node to nodes that are its initial contacts. The second process is 
when the new node is further connected to the nodes among the neighbors 
of the initial contacts. In the following we assume that the probability of this 
second process is linear with respect to the degree of the node which leads 
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to implicit preferential attachment. The rate equations are: 
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where ki is the degree of node i and we assumed that kis = kiPs and kid = hpd. 

All possible combinations of triads of a new node f , the initial contact 
j, and the secondary contact i are schematically shown in Fig. [U^a-h) and 
presented by the third through the sixth terms in Eq. ([1]) and Eq. ([2]) for an 
N and V node, respectively. 

For example, the fifth term in Eq. ([1]) describes the rate of change of the 
degree of vertex i (which is an node) due to establishing the configuration 
of contacts shown in Fig. [lb. The fifth term contains four factors. The first 
factor is the average number of secondary contacts which is rrirms- pv is 
the probability that the new node created at time step t \s Sl V node, ps 
is the probability that the newly created node connects to the initial con- 
tact which is a node of the same type. Finally, ki^/ yy kj + J^j nv ^i) 
the probability that the node selected for initial contact (node j in Fig. dt) 
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shares an edge with the node i. This is a standard expression for preferential 
attachment except for the comphcations induced by having two distinct pop- 
ulations. The nominator kid is the degree of the i node if we count only the 
links to different types of nodes (in this case V nodes). The denominator is 
the sum of all possible links which the type of node selected for initial contact 
(in this case V) could have. Assuming that the initial and secondary con- 
tacts are created with the same probabilities the nominator would be equal to 
kid = kiPd and we will approximate it in this way and the denominator would 
be equal to {pvPs + Pd)2mr(l + ms)t. We will use this expression as a first 
approximation and we will also empirically derive functional dependences of 
the denominator and compare the results. 

Note that the number of edges, multiplied by two, that exist between N 
and N nodes is equal to kj and include both edges created as initial 

contacts and edges created as secondary contacts. The same is true for the 
number of edges between V and V nodes which is yy kj, and the number 
of edges between N and V nodes which is kj in Eqs. ([1]) and ([2]). We 

know that the probabilities for creating edges as initial contacts are Ps or 
Pd if between same type of nodes or different types of nodes, respectively. 
We do not know, however, what these probabilities are when the edges rep- 
resent connections to secondary contacts. This is the reason to express the 
respective summations by the following relations: 

'^kj = g{pN{pN,Ps))'2^mr{l + ms)t (3a) 

j,NN 

kj = h{pv{pv,Ps))'^mr{l + ms)t (3b) 

j,vv 

^ kj = q{pd{Pd))'^ mr{l + ms)t. (3c) 

j,NV 

Eqs. ([3]) contain the term 2 7/1^(1 + ms)t because there are ~ t vertices at time 
t and mril + m^) is the average initial degree of a vertex. g{pN{PN,Ps)) in 
Eq. (15al) is the probability that the edge is between N nodes. h{pv{pv,Ps)) 
Eq. pb|l is the probability that the edge is between V nodes, and q{pd{Pd)) 
Eq. (I3c|) is the probability that the edge is between nodes of different types. 
Functions g, h, and q contain edges established due to both initial and sec- 
ondary contacts, whose contributions cannot be separated and derived an- 
alytically. Therefore, we will obtain these functional dependences through 
empirical considerations. If we assume that the edges to secondary contacts 
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are established with the same probabihties as the edges to initial contacts, 
then the relations would have been: 



9{Pn{pn,Ps)) = PnPs (4a) 
h{pviPv,Ps)) = PvPs (4b) 
(}{Pd{Pd)) = Pd- (4c) 

2.2. Solutions of the Rate Equations 

After separating the variables and integrating the rate equation of the 
degree /cj(7v) of N node Eq.(IT]) from kinu to ki and from ti to t we obtain the 
following expression for the degree as a function of time ki{t) 

hN){t) = GJ-j -G2, (5) 

where 

^ _^ f PnpI + PvPsPd ^ PnpI + PvPsPd \ V 

^ V 9{Pn) + q{pd) Kpv) + q{pd) ) 

G2 = A^Gi (6b) 

G3 = G2 + kinit (6c) 

C = 2(1 + ms)/ms (6d) 

An = rrir {pnPs + PvPd) /pn (6e) 

kinit = mr{l + ms). (6f) 

Integrating the rate equation of the degree ki of V nodes, Eq.([2]) produces 
the time dependence of the degree of any V node 



where 



k,^y){t) = H,[-] -H2, (7) 



jj^^Q f PvPI + PNPsPd ^ PvpI + PNPsPd \ ~ ^g^^ 

^ V h{pv) + q{pd) g{pN) + q{pd) J 

H2 = AyH, (8b) 

H3 = H2 + kinit (8c) 

Av = nir {pvPs + PNPd) Ipv- (8d) 
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If we assume that the edges to secondary contacts are estabhshed with the 
same probabihties as the edges to initial contacts, then the solution of the 
rate equation Eq. ([1]) will be the following for an N node 

/ 1 \ ^l""^ 

k,^^^{t) = DJ-) (9) 



where 



C^ - C ( ^ PvP^Pd I PnpI + PvPsPd \ ^^Q^^ 

^ V PnPs + Pd PvPs +Pd J 

D2 = AnDi (10b) 

^3 = ^2 + Knit, (lOc) 



and for a V node: 



where 



Kiy){t)=E;[-] (11) 



E, - C ( ^^^^ ^ PNPsPd ^ Pvpj + PNPsPd \ ,^2a) 

^ V PvPs + Pd PnPs +Pd J 

E2 = AyEi (12b) 
^3 = ^2 + kiniU (12c) 

after making use of Eqs. (jlj). 

2.3. Degree distribution 

In the mean field approximation, the degree fcj(t) of a node i evolves with 
time t strictly monotonically after the node was added to the network at 
time tj. Therefore, the nodes added to the network more recently will have 
on average lower degree than those added to the network earlier. Assuming 
that we add nodes to the network at equal intervals, the probability density of 
ti is 1/t. Using the properties of cumulative probability distribution function, 
we can write that the probability of a node to have degree k < ki is equal to 
the probability that the node has been added to the network at time t > t^ 

F{k) = P{k < ki) = P{i > ti) = (13) 
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We can derive the probability density function of N nodes, PN{k) by obtain- 
ing an expression for t from Eq. (jSj) , then replacing it in Eq. f[T^ and differ- 
entiating the resultant equation with respect to fcj, which is P{k) = dF/dk. 
The result is: 

P^(k) = Gi(G2 + ky^'-'Gf\ (14) 

Similarly, we can derive the probability density function of V nodes, Pv{k) 
by obtaining an expression for t-^'' from Eq. ([7]), then replacing it in Eq. (1T3|) 
and differentiating the resultant equation with respect to ki to obtain: 

Pvik) = H,iH, + k)-^^-^H^\ (15) 

If we use Eqs. ([9]) and ffTTj) to derive expressions for t-^-* and t^^\ respectively, 
then the degree distributions are presented by 

PM{k) = D,{D^ + k)-''^-^D^^ (16) 

Py{k) = E,{E, + k)-''--'E^\ (17) 

It should be noted that hi and all quantities are expectation values and can be 
compared to simulation results which are assemble averages. The analytical 



results converge to those reported in Ref. [13| in the limit of one population 



which means pa? = 1, py = 0, = 1, and pd = 0. 

2.4- Clustering Characteristics 

The dependence of the clustering coefficient as a function of the degree of 
a node can be derived using the rate equation method llll . The number 
of triangles -Ei(Ar) (-E'i(v')) around a node if z is an iV {V) node is changing with 
time following two processes. The first process is when node i is selected as 
one of the initial contacts with probability Aj^/t {Ay/t) and the new node 
links to some of its neighbors which are on average. The second process 
is when node i is selected as a secondary contact and a triangle is formed 
between the new node, the initial contact, and the secondary contact. It 
is possible that two neighboring initial contacts and the new node form a 
triangle, but the contribution of this process is negligible. The rate equation 
for the number of connections between the nearest neighbors of a node of 
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degree kny (kiv) is given by 
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respectively. After some algebra and using Eq. ([T]) if ? is an node and 
Eq. ([2]) if i is a node we obtain 



dt 



dt 
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(20) 



(21) 



dt dt ' t 

After integrating both sides of Eqs. (!20l) and (12T1) with respect to t, using the 
initial condition Ei(^N){kiniuti) = Ei(y){hnit,ti) = rrirms, and hnu given by 
Eq. (l6fj) . we obtain the expressions for the change with time of the number 
of connections between the nearest neighbors of a node of degree h^ 



Ei{N) = h-rrir + {rus - I) An In 



(22) 



11 



if z is an node and of a node of degree kiv 

Eiiv) = ki-mr + {rris - I) Ay In (23) 

if z is a \^ node. We use Eqs. ffT^ and ffT^ to obtain expressions for ln(t/tj) 
and insert them in Eqs. fl22]) and fl25]) . respectively. Finally, the degree- 
dependent clustering coefficient C{k) as a function of the degree k of the 
node which is also referred as the clustering spectrum, is given by 

C^(fc) - 2 (24) 

for an node and by 

fc - + (m, - l)AvH^ In [(fc + i/^) /H-,] 
Cy{k) = 2 (25) 

for a V node, where we make use of C{k) = 2E{k)/[k{k — 1)], which defines 
the clustering coefficient of a vertex as the ratio of the total number of ex- 
isting connections between all k of its neighbors and the number k{k — l)/2 
of all possible connections between them. The degree- dependent clustering 
coefficient C{k) defines the local clustering properties of the network. 

The global clustering characteristics of a network include the mean clus- 
tering coefficient C as averaged over the vertex degree, the mean clustering 
< C >= i/nJ2iCi as averaged over the nodes of the network (where Cj 
is the clustering coefficient of node i), and the so-called transitivity C 
Making use of degree- dependent local clustering coefficient (Eqs. (|2^ and 
fl25|l ) and the degree distribution (Eqs. f|T6|) and f|T71) ) for or \^ node one 
can define the respective mean clustering coefficient as: 

C = J2mC{k). (26) 

k 

Transitivity is a measure of the ratio of the total number of loops of length 
three in a graph to the total number of connected triples and is defined as 

E,P{k)k{k -l)/2- ^ > 

The mean clustering coefficient C and the transitivity C assess in a different 
manner the clustering properties of a graph. In real networks they could 



have very different values for the same network [18 
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3. Comparison between Theory and Simulation Results 

We compare three outputs: the analytical derivation of degree distribu- 
tion P{k) (Eqs. flTB]) and f[T7l) ) that was obtained assuming that the links to 
secondary contacts are established with the same probabilities as the links 
to initial contacts Eq. (jl]), the derivation of P{k) (Eqs. (fT^ and ( !T5|) ) ob- 
tained by using functional dependences Eq. ([3]), and the numerical simulation 
results. 

3.1. Functional dependence 

First, let us focus on the functional dependence of the probability that 
the edge is between N nodes q^Pn) on Pn (Eq. fl3a|) ). which represents the 
combined probabilities that the edge is between same type of nodes Ps and 
the probability that they are nodes. We empirically estimate g{pN{PN,Ps)) 
by calculating the number of A^A^ edges, multiplying it by two and dividing 
it by the average number of nodes at time t which is 2/71^(1 + ms)t. We 
obtain the functional dependence of empirically estimated g{pN) by a matrix 
multiplication of two vectors; one is the vector of g{pN) values as a function 
of Pn at fixed ps = 0.7 and the other is the vector of q^Pn) values as a 
function of ps at fixed pn = 0.8. Next, we aim to construct a function of pn 
and Ps, pn{pn,Ps) such that the empirically estimated g{pN{,PN,Ps)) which 
express the probabilities for both initial and secondary contacts is a linear 
function of pjy. We plot the result for Case I (one initial contact and two 
secondary contacts) in Fig. [2^ for p^ G [0.5, 0.9] at fixed values of Ps = 0.7 
and for ps G [0.5,0.9] at fixed values of pn = 0.8 (circles Q)- Squares (□) 
Fig. [2^ mark results for p^ G [0.5, 0.9] at fixed values of Ps = 0.5 and for 
Ps G [0.5, 0.9] at fixed values of pn = 0.5. We obtain that the combined 
probability Pn{pn,Ps) of the form 

Pn{PN,Ps) =PN{Cpr,+Ps{l-Cp^)), (28) 

produces the following least-squares linear fit 

gipN) = 0MpN-0.10±AN (O) (29a) 
g{pN) = 0.29pN - 0.04 ± Ay (□) (29b) 

for Cp^ = 0.8 (O) and Cp^ = 0.5 (□), respectively. 

The prediction error estimate was generated for g{pN) and found to be 
Ajv = 0.0221 (for Q) and = 0.0095 and (for □) which allows us to obtain 
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Figure 2: (a) Probability to establish both initial and secondary contacts between A'' nodes 

g{pN{PN,Ps)) as a function of pn{pn,Ps) = Pnicpf, - Cp^.)), where Cp„ = 0.8 for 

Pn = 0.8 (O) and Cp„ = 0.5 for pt^ = 0.5 (□). (b) Probability to establish both initial 
and secondary contacts between V nodes h{pv{pv,Ps)) as a function of pv{PViPs) = 
Pv{cpv + Ps(l - Cpv))i where Cp^ = 1 - Cp^ = 0.2 for pv = 0.2 (Q) and Cp^ = 0.5 for 
Pv = 0.5 (□). (c) Probability to establish both initial and secondary contacts between 
different types of nodes as a function of Pd{Pd) = Pd/{Pd + 2) for pd € [0.1,0.5] at fixed 
Pn = 0.8 (0)j and fixed pM = 0.5 (□), respectively. Values on y-axis in (a) represent 
matrix multiplication of the vector of g{pNipN,Ps)) as a function of pn G [0.5,0.9] at 
fixed Ps = 0.7 (O) {Ps = 0.5 (□)) multiplied by the vector of normalized number of NN 
edges as a function oi Ps £ [0.5,0.9] at fixed pN = 0.8 (Q) {pn = 0.5 (□)). Values on 
y-axis in (b) represent matrix multiplication of the vector of h{pv {pv , Ps)) as a function of 
Pv G [0.1, 0.5] at fixed Ps = 0.7 (Q) {Ps = 0.5 (□)) multiplied by the vector of normalized 
number of VV edges as a function of ps G [0.5, 0.9] at fixed pv = 0.2 (Q) {pv = 0.5 (□)). 
Case I: one node as initial contact and two nodes as secondary contacts. 
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a range of values for q^Pn) limited by q^Pn) ± ^n- We use these functional 
dependences within their range to express g{pN{PN,Ps)) in the solutions of 
the rate equations Eqs. ([5]) and ([7]) and in the expression of respective degree 
distributions and clustering coefficients. 

Applying the same reasoning, we obtain the combined probability 
h{pv{pv,Ps)) as a function of pv which represents both the initial and sec- 
ondary contacts. Results are shown in Fig. [2)3 for pv G [0.1,0.5] at fixed 
Ps = 0.7 and ps G [0.5, 0.9] at fixed pv = 0.2 (O)- Squares (□) mark results 
for Pv G [0.1, 0.5] at fixed ps = 0.5 and ps G [0.5, 0.9] at fixed pv = 0.5. For 
an argument of the form 

Pv{Pv,Ps) = Pvicpy +Ps{i -Cpv))i (30) 
the least-square fit produces 

h{pv) = 0.npv±Av (O) (31a) 
h{pv) = 0.20 Pv -0.01 ±Av (□) (31b) 

for Cpy = 1 — Cpjy = 0.2 (O) and Cpy = 0.5 (□), respectively. The prediction 
error estimate defines the range of values for /i(py) ± Ay, where Ay = 0.0056 
(for O) and Ay = 0.0039 (for □). 

We obtain the probability q{pd{Pd)) for creating a link between different 
type of nodes both as initial and as secondary contacts to be 

g(p,) = 1.60prf + 0.1±Arf (O) (32a) 
g(p,) = 1.86p, + 0.15 ± A, (□). (32b) 

The dependence of q{pd{Pd)) as a function of pd is shown in Fig. ^ for 
Pd G [0.5, 0.9] at fixed pn = 0.8 (Q) and pn = 0.5 (□). We applied the linear 
least-square fit for an argument of the form 

Pd{Pd) = Pd/{Pd + 2). (33) 

The prediction error estimate is obtained to be A^ = 0.0207 (for Q) and 
Ad = 0.0265 (for □). 

We use the above parameterization procedures of the probabilities to es- 
tablish both initial and secondary contacts between N nodes g{pN{,PN,Ps)), 
between V nodes h{pv{pv, Ps)), and between different types of nodes q{pd{Pd)) 
in obtaining the degree distribution (Eqs. f|T^ and f|T5|) ) and clustering co- 
efficient (Eqs. fl2^ and fl25]) ) for each of the cases considered below. 
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Figure 3: Degree distribution for two populations consisting of 80% N nodes (o) pjy — 0.8 
and 20% V nodes pv = 0.2 (•). Edges are formed with probability Ps — 0.7 if they connect 
TV with N 01 V with V nodes and with probability pd = 0.3 if they connect N with V 
nodes. Symbols represent results from numerical simulations done on a network with 
10^ agents and averaged over 100 runs. Lines represent the results of degree distribution 
P{k) given by Eq. (fT4|) for N nodes (solid) and by Eq. (fTSj) for V nodes (dashed) using 
functional dependences shown in Fig. [5] to obtain functions ± Ajv, /Oy ± Ay, and 
Pd ± Ad- The set of two lines correspond to using or —A^, for each of p,^, where 

v — N,V, d. The dash-dotted line represents analytical distribution P{k) obtained using 
Eq. for N nodes. The dotted line represents analytical distribution P{k) obtained 
using Eq. pT|) for V nodes, (a) Case I: one node as initial contact and two nodes as 
secondary contacts; (b) Case II: one node as initial contact with probability 0.9 and two 
nodes as initial contacts with probability 0.1; the number of nodes as secondary contacts 
is from uniform distribution t/[0, 3]; (c) Case III: two nodes as initial contacts; the number 
of nodes as secondary contacts is from uniform distribution [/[O, 2]. 
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3.2. Degree distribution 

The analytical distribution P{k) is given by Eq. (fT6|) for nodes and 
by Eq. ( 1T7|) for V nodes and plotted in Fig. [3] with dash-dotted and dotted 
lines, respectively. The degree distribution P{k) obtained using functional 
dependences is given by Eq. f|T^ for nodes and by Eq. f|T5|) for V nodes 
and plotted in Fig. [3] with solid and dashed lines, respectively. Simulations 
are conducted on a network with 10^ nodes starting with a seed network of 
8 nodes and are averaged over a 100 runs. All three cases considered are for 
value of the probability to create an N node p^r = 0.8 and for the value of 
the probability to establish a link between the same type of nodes ps = 0.7. 
To touch upon the versatility of the model we consider three different cases. 
They are Case I: one node as initial contact rrir = 1 and two nodes as 
secondary contacts = 2 (Fig. Case II: one node as initial contact 
with probability 0.9 and two nodes as initial contacts with probability 0.1, 
which gives rUr = 1-1; the number of nodes as secondary contacts is from 
uniform distribution f/[0, 3] and therefore, rrig = 1.5 (Fig. [3]d); Case III: two 
nodes as initial contacts rrir = 2; the number of nodes as secondary contacts 
is from uniform distribution f/[0, 2], = 1 (Fig. [St). Results demonstrate 
that for all cases considered the simulations compare relatively well with the 
analytical derivation of the degree distribution even though using functional 
dependences in P{k) derivation improves the agreement within the limits of 
the simulations. 

3.3. Clustering 

The simulation results for the clustering coefficient as a function of the 
degree of the node C{k) for and V nodes for Case III are plotted in 
Fig. m with empty and full circles, respectively. The analytical solution for 
clustering coefficient C{k) using Eq. for N nodes and Eq. for V 
nodes are plotted with lines which coincide with each other. A clear C{k) ~ 
1/k trend is observed which indicates the hierarchy in the system. 

The global clustering properties of the network are assessed by the mean 
clustering coefficient C (Eq. (126 p ) which is averaged over vertex degree, and 
the transitivity C (Eq. We study how C and C change as a function 

of pn for a fixed value of ps (plotted in Fig. |5K,c) and as a function of ps for 
a fixed value of pn (plotted in Fig. |5]3,d). The mean clustering coefficient C 
for Case I (circles in Fig. |5^) has values in the range between 0.41 and 0.43 as 
a function of p^ for both N and V nodes for fixed value of Ps = 0.7. In both 
Case II (squares) and Case III (diamonds), where the number of secondary 
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Figure 4: Clustering coefficient as a function of the degree of the node for N nodes (o) 
and V nodes (•) for pN — 0.8 and Ps = 0.7. Symbols represent results from numerical 
simulations. Lines (indistinguishable) represent the results of theoretical derivation of 
clustering coefhcient as a function of the degree of the node using Eq. (|24|) for N nodes 
and Eq. (|25|) for V nodes. Case III: two nodes as initial contacts; the number of nodes as 
secondary contacts is from uniform distribution C/[0, 2]. 

contacts is drawn from a uniform distribution, the C{pn) for and V nodes 
is symmetrical with respect to its value at pn = 0.5. Values of C for and 

V nodes as a function of ps (Fig. [5b) demonstrate a tendency to converge for 
Ps approaching one. 

The transitivity C (Fig. [5t) as a function of p^ shows a symmetrical 
pattern for N and V nodes with respect to its value at p^ = 0.5 similar to C 
behavior but with a wider difference between the results for and V nodes 
and different values. As the probability to establish a link between same 
type of nodes ps increases the values of transitivity C (Fig. [5li) for and 

V nodes converge. Higher values of both C and C among the three cases 
are obtained for Case II when there is an option to create either one or two 
initial contacts and the number of secondary contacts vary as well, e.g. in 
t/[0,3]. 

3.4- Assortativity 

The assortative properties of the network describe the degree correlation 
of the nodes at the ends of an edge and are quantified by the Pearson corre- 
lation coefficient r. The network is said to be assortative when high degree 
nodes tend to connect to other high-degree nodes and r G (0,1] [l9|. The 
network is characterized by disassortative mixing when high degree nodes 
tend to connect to low degree nodes and r G [—1,0). We calculate r of each 
of A^A^, VV , and A^V^ networks obtained for a fixed value of pn = 0.8 and 
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Figure 5: Values of the mean clustering coefficient C (Eq. ([26]) (a) as a function of the 
probability to create an N node pN for a fixed value of the probability to create a link 
between same type of nodes ps — 0.7 and (b) as a function of Ps for a fixed value of 
Pn = 0.8 for N nodes (empty symbols) and V nodes (full symbols). (c,d) Same as (a,b) 
but for the values of transitivity C (Eq. (I?fl) . Circles {No/V») represent Case I. Squares 
{ND/VM) represent Case II and diamonds {N()/V^) mark Case III. Cases I, II, and III 
are as defined in Fig. 3. 
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Figure 6: Values of assort ativity coefficient r as a fimction of the probability to create a 
link between same type of nodes Ps for a fixed value of pn = 0.8 of (a) NN network; (b) 
VV network; (c) NV network. Circles (•) represent Case I. Squares (■) represent Case 
II and diamonds (♦) mark Case III. Cases I, II, and III are as defined in Fig. 3. 
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plot the r- dependence as a function of Ps in Fig. for the three cases 
considered. A^A^ and NV networks of Case I (•) show shght disassortative 
mixing owing to the two initial contacts and one secondary contact. Increas- 
ing the probability ps to create an edge between the same type of nodes for 
Case I (• in Fig. [6]d) results in changing from disassortative for ps < 0.5 to 
slightly assortative for ps > 0.5. Varying the number of initial and secondary 
contacts in Case II (■) and III (♦) produces assortatively mixed A^A^ and 
VV networks (Fig. [6K,b). The assortativity coefficient rw of the VV net- 
works (■ and ♦ in Fig. [6]d) increases linearly with increasing the probability 
to create a link between the same type of nodes reaching a plateau around 
Ps ~ 0.7 and then decreasing for p^ = 0.9 probably because all available 
nodes are already connected. The values of ryy are larger versus rj^^ be- 
cause of the smaller number [pv = 1 — Pn = 0.2) of V nodes available for 
contacts. A recent visualization of the connections in a terrorist network 
such as the Global Salafi Jihad depicts them to form an assortative network 



20| . For all three cases the NV networks (Fig. [6t) show disassortative mix- 
ing of their degree only except for very high values of Ps = 0.8 and 0.9 for 
Case III (♦). A recent empirical study of an online social system reports 
that relationships driven by aggression lead to markedly different systemic 
characteristics than relations of a non-aggressive nature |21j. Assortativity 
is a characteristic of global properties of the system. In agreement with the 
empirical findings the assortativity of A^V^ network (Fig. Et) produced by our 
model which represents relationships driven by aggression is clearly different 
from the assortativity of A^A^ and VV networks (Fig. |6K,b) which are driven 
by non-aggressive relationships. 



4. Conclusions 

We introduced a model intended to characterize the interactions between 
two distinct populations, which form links more easily within their group 
than between groups. We aim to describe the interactions of potentially 
violent terrorist groups within the context of a largely non-violent popula- 
tion, although the same model could, in principle, be applied to other non- 
mainstream social groups. The model is kept simple enough so that analytical 
solutions could be derived and compared with empirical parameterizations 
and numerical simulation results. 

The model produces networks with relatively high mean clustering co- 
efficient C and transitivity C. Their values vary with the balance between 
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the initial and secondary contacts. This is expected because of the inter- 
play between the random and preferential attachments of the initial and 
secondary connections, respectively. The assortativity pattern of modeled 
networks show that the potentially violent VV network qualitatively resem- 
bles the connectivity pattern in terrorist networks reported in 20j. The 



assortativity behavior of NV network which is driven by aggression is clearly 
different than the assortativity pattern of A^A^ and VV networks which are 
non-aggressive relationships; a finding which is in agreement with the results 



of recent empirical study of an online social system [21 
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